Spontaneous Emergence of Modularity in a Model of Evolving Individuals 
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We investigate the selective forces that promote the emergence of modularity in nature. We 
demonstrate the spontaneous emergence of modularity in a population of individuals that evolve 
in a changing environment. We show that the level of modularity correlates with the rapidity and 
severity of environmental change. The modularity arises as a synergistic response to the noise in the 
environment in the presence of horizontal gene transfer. We suggest that the hierarchical structure 
observed in the natural world may be a broken symmetry state, which generically results from 
evolution in a changing environment. 
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Modularity abounds in biology. Elements of 
hierarchy — modules — are found in developmental biol- 
ogy, evolutionary biology, and ecology [J 0, El- Mod- 
ularity is observed at levels that span molecules, cells, 
tissues, organs, organisms, and societies. At the genomic 
level, there are introns, exons, chromosomes, and genes. 
Moreover, there are mechanisms to rearrange and trans- 
mit the information that is modularly encoded at the ge- 
nomic level, such as gene duplication, transposition, and 
horizontal gene transfer 0, @] . We define a module to be 
a component that can operate relatively independently of 
the rest of the system. From a structural perspective, ex- 
istence of modularity means there are more intra-modulc 
connections than inter-module connections. From a func- 
tional perspective, a module is a unit that can perform 
largely the same function in different contexts. Modular- 
ity has been characterized in a variety of network systems 
by physical methods @, 0] ■ Selection for stability, for ex- 
ample, has been shown to select for modular networks Q. 
A dictionary of constituent parts, or network motifs, has 
been identified for the transcriptional regulation network 
of E. coli 0]. And once modularity has arisen, so that 
the goals a species face become modular, modularly vary- 
ing goals have been shown to select for modular structure 
[lfj. Horizontal gene transfer has been suggested to be 
essential to the evolution of a universal genetic code 11 1 . 

How does modularity arise a priori in nature? It has 
been suggested that by being modular, a system will tend 
to be both more robust to perturbations and more evolv- 
able [3, [3, 14 1 . It has further been suggested that 



there is a selective pressure for positive evolvability in 
a population of individuals in a changing environment 
hj. Thus, we have hypothesized that modularity arises 
spontaneously from the generic requirement that a popu- 
lation of individuals in a changing environment be evolv- 
able 11611. Support for this hypothesis has to date been 
elusive |l7j|. 

In this Letter, we show the hypothesis of spontaneous 
evolution of hierarchy in a system under changing envi- 
ronmental conditions to be valid. Specifically, we show 
that in the presence of horizontal gene transfer, envi- 



ronmental change leads to the spontaneous emergence of 
modularity in a generic model of a population of evolv- 
ing individuals. To represent the replication rate, or mi- 
croscopic fitness, of the individuals, we use a spin glass 
model that has proved useful in previous studies of evo- 
lution 01, 0, 



2fJ. To be specific, we choose parameter 
values appropriate to describe a population of evolving 
proteins [ll El, El HH . Spontaneous emergence of mod- 
ularity, however, generically occurs for a population of 
evolving individuals and depends only on the presence of 
a changing environment and the presence of horizontal 
gene transfer. This spin glass model is appropriate be- 
cause it provides a rugged, difficult landscape upon which 
evolution struggles to occur, and so there can be a pres- 
sure for more efficient evolutionary structures to arise. 
There are three time scales in our system: the fastest 
time scale of sequence evolution of the individuals in the 
population, the intermediate time scale of environmen- 
tal change, and the longest time scale of the change to 
the structure of protein fold space. The symmetry of 
a uniformly random structure is broken by the sponta- 
neous emergence of modular structure as a response to 
environmental change. 

We use the following spin glass form for the microscopic 
fitness of proteins in our system (for a discussion on the 
spin glass approach to evolution, see [IB, 19|, 2(| 2l[). 
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where s"' k is the amino acid identity of the sequence a, k 
within fold a at position i, and N = 120 is the length 
of the protein sequence. We consider the amino acids to 
lie within 5 classes (l9j . The term <7^ j(sj, Sj), is the in- 
teraction matrix, symmetric in i and j, whose elements 
are each taken from a Gaussian distribution with zero 
mean and unit variance. It differs for each i, j, S4, and 
Sj. The effect of the environment is encoded by these 
random couplings. When the environment changes with 
severity p, each of the couplings is with probability p 
randomly redrawn from the Gaussian distribution. The 
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term Afj defines the protein fold, i.e. the contact ma- 
trix, or connections in structure, for fold a. The matrix 
is symmetric, with elements or 1. In order to guarantee 
that the emergence of modularity comes from redistribu- 
tion of connections rather than an increase in the number 
of connections, we constrain J2i>j+i = Nd = 346. 
Any value of Np such that the connection matrix is nei- 
ther all unity nor all zero would give qualitatively similar 
results. We take A% = and A£ i±1 = 1. 

Because horizontal gene transfer will be assumed to 
transfer any of the 12 blocks of length 10 in the se- 
quence, modularity is defined by the number of connec- 
tions within the 12 10 x 10 blocks along the diagonal 
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so that i, j are within the 1 + k th diagonal block of size 
10. Even a random distribution of contacts will have a 
non-zero absolute modularity, M , and so it is the ex- 
cess modularity that measures the degree of spontaneous 
symmetry breaking, 6M a = M a — M . Emergence of 
modularity means that as a result of evolution, connec- 
tions in structure are not evenly distributed between po- 
sitions. The interactions are greater in the local, diagonal 
blocks than in the rest of the matrix, and so SM a > 0. 

In order to see the emergence of modularity, we need a 
set of individuals in a changing environment. Moreover, 
we need a population of these sets, each set with a differ- 
ent •. We take the population size to be D SiZC = 300 
different structures, 1 < a < D sizc , and each given struc- 
ture has a set of iV sizc = 1000 different sequences, 1 < k < 
N s - lze , associated with them. The average excess modu- 
larity is given by 5M = M— M = £a=i e M a - M . 

The structures, Af^, are initialized by first randomly 
generating one such structure with No = 346 and a cer- 
tain M. We then obtain the full set of D slze structures 
by mutation away from this structure. Two elements of 
Afj with opposite status are randomly chosen, and the 
status of each is flipped from 1 — ► 0, — ► 1. These mu- 
tations are done n times, where n is a Poisson random 
number with mean 2. The sequences, s"' k , 1 < i < N, of 
each individual arc initialized by random assignment. 

The evolution in our simulation involves three levels 
of change. The most rapid change occurs by evolution 
of the sequences through point mutation and gene seg- 
ment swapping. For each structure Af at each round, 
all the iVsize associated sequences undergo point muta- 
tion, gene segment swapping, and selection. The Pois- 
son point mutation process changes on average 2.4 amino 
acids per sequence, which are randomly selected and as- 
signed a random class. In gene segment swapping two 
randomly selected sequences from the population associ- 
ated with one structure attempt to exchange each of the 
12 sequence fragments between 10A; + 1 and lOfc + 10 (of 
length 10) with probability 0.1. The qualitative behav- 



ior of the results does not depend on the exact mutation 
rates. Pairs of sequences in the population associated 
with one structure are chosen, until all sequences have 
been chosen. This process is a model of horizontal gene 
transfer and recombination. The 50% sequences with 
the lowest energy are selected and randomly duplicated 
to recover the population of N s - lzc for the next round; 
the microscopic replication rate, or fitness, for sequence 
a, k in structure a is r a (s a > k ) = 26[H§, 2 - H a (s a ' k )], 
where 6{x) is the Heavyside step function. Mutation and 
selection are repeated T2 rounds. 

The next most rapid change is that of the environment, 
which occurs with severity p and frequency I/T2. During 
the environmental change, the elements of the interaction 
matrix er, change with probability p. 

The slowest level of change is the structural evolution. 
The selection at this level is based on the cumulative fit- 
ness of the set of individuals with a given structure, aver- 
aged over T3 = 10 4 T2 environmental changes. The struc- 
tures with the best 5% cumulative fitness are selected 
and randomly amplified to make the new population of 
-Dgizc structures, Ag. The structure population also un- 
dergoes mutation. As with the initial construction, two 
elements of A™j with opposite status are randomly cho- 
sen, and the status of each is flipped from 1 — ► 0, — ► 1. 
These mutations are done n times, where n is a Poisson 
random number with mean 2. The mutated structures, 
Afj , are used for the next T 3 rounds of evolution. 

In Fig. [1] we show the spontaneous emergence of mod- 
ularity from the symmetric, random state of no excess 
modularity, M = Mq = 22. Since the system is initially 
quite far from the steady state modularity, the growth of 
the excess modularity with time is roughly linear. The 
excess modularity is the order parameter for this system, 
and its growth shows that the system is in a broken sym- 
metry phase with modular structure under these condi- 
tions. Interestingly, the growth of modularity is identical 
for an initial contact matrix that is power-law distributed 
with 7 = 3. Figure [1] shows that emergence of modularity 
in this model requires both horizontal gene transfer and 
a changing environment. The spontaneous emergence of 
modularity is a general result. In Fig. [H we show the 
excess modularity still grows, even if the gene transfer 
starts at a uniformly random position and swaps a ran- 
dom length of sequence. The original assumption of fixed 
length and position, however, is biologically motivated. 
If the blocks are exons, and the ratio of non-coding to 
coding DNA is large, then typical recombination or hor- 
izontal gene transfer will transfer an integer number of 
complete blocks, which is our horizontal gene transfer 
operator of fixed length and position. 

The system adopts the broken-symmetry, modular 
state not because the point mutation and gene segment 
swapping moves favor modularity a priori, but rather be- 
cause these moves enable the system to respond more 
effectively to a changing environment when the system 
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FIG. 1: Spontaneous emergence of excess modularity, M > 
A/o = 22 from a state with no excess modularity, M = Mo. 
The random, symmetric distribution of structural connections 
is spontaneously broken as system evolves. Here T2 = 20, 
and the severity of environmental change is p = 0.40. The 
upper left inset shows the growth of modularity starting from 
a power law distributed contact matrix (7 = 3). The up- 
per middle inset shows the improvement in the energy as the 
modularity grows. The upper right inset shows the improve- 
ment of evolvability, or change in energy in one environment, 
as the modularity grows. The inset in the middle row shows 
the emergence of modularity as a result of a horizontal gene 
transfer operator with a Poisson random swap length and uni- 
form random starting position. Shown are data for an average 
swap length of 10 (O), 20 (□), 20 (0), 5 (A), and 40 (v) with 
12, 6, 12, 24, and 3 attempted swaps, respectively, of proba- 
bility 0.1 per sequence pair. The lower left inset shows how 
the energy changes within one environment and between en- 
vironmental changes (T2 = 20). The lower right inset shows 
that emergence of modularity requires both environmental 
change and horizontal gene transfer. In all cases modularity 
is measured by Eq. [2] and excess modularity is measured by 
6M a = M a - M , with M = 22. 



FIG. 2: The velocity at which modularity grows is positively 
correlated with the magnitude of environment change, p. The 
frequency of environment change is set at I/T2 = 1/40. The 
inset shows the response function of the system dM/d(t/Tz) 
as a function of the severity of environmental change. 
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is modular. That is, evolvability is implicitly selected for 
in a changing environment, and gene segment swapping 
enhances evolvability if the system is modular. Thus, 
we expect modularity to be implicitly selected for in a 
changing environment in the presence of horizontal gene 
transfer, with the degree of modularity correlated to the 
degree of environmental change. In Fig. [2] we show the 
change of modularity with time for different severities of 
environmental change, p. For this figure, we choose the 
initial set of structures from an ensemble with M = 147, 
rather than M = Mo, to show the change of modularity 
more clearly. For no environmental change, the mod- 
ularity decreases from this high level. But for positive 
environmental change, the modularity increases from the 
initial, high level. The velocity of the increase is larger 
for greater environmental change. 

Another way of characterizing the environmental 
change is by the frequency of change, and the emer- 



FIG. 3: Frequency of environmental change also affects the 
time evolution of spontaneous modularity. The severity of 
environment change is p = 0.40. The upper inset shows the 
growth of modularity versus real time rather than versus time 
relative to the number of environmental changes. The lower 
inset shows the response function of the system dM/d(t/10°) 
as a function of the frequency of environmental change. 

gence of modularity depends on this parameter as well. 
In Fig. [3] we show the growth of modularity with time for 
different frequencies of environmental change. For fre- 
quencies of environmental change that are not too large, 
the modularity increases with frequency. For very high 
frequencies, I/T2 > 1/5, the system is unable to track the 
changes in the environment, and the modularity decays 
with frequency. The velocity of modularity increase in 
Fig. [3] for p = 0.40 and T 2 = 20 is less than that in Fig. [TJ 
because in Fig.[31 the system is closer to the steady-state, 
broken-symmetry value than it is in the Fig. [TJ 
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FIG. 4: If the initial value of the modularity is greater than 
the steady-state value, the modularity decays with time. Here 
T2 = 20, and the severity of environment change is p = 0.40. 



The spontaneous emergence of modularity is caused by 
the historical variation in environments that the system 
has encountered. By a fluctuation-dissipation argument 
[lil . I22I [23[, we might expect that the degree of mod- 
ularity should be proportional to the variance of envi- 
ronments encountered. In the inset to Fig. [2] we show 
that the velocity of the increase in modularity is roughly 
proportional to the severity of environmental change, p. 
In the inset to Fig. [3] we show that the velocity of the 
increase in modularity is roughly proportional to the fre- 
quency of environmental change, I/T2. 

While the modularity grows with time in Figs. [TH3] for 
p > and T2 > 5, at steady state the system will be only 
partially modular, M < = 346, reflecting a balance 
between the selection for modularity in a changing en- 
vironment and the mutations driving the system toward 
the symmetric state of no excess modularity. Sec Fig. 2] 
The excess modularity in the broken symmetry state is 
positive because of selection for modularity in fluctuat- 
ing environments, and the excess modularity is not the 
maximal possible value of M = Nd = 346 because of the 
entropic effects of the mutations in the sequence space. 
For the initial condition used in Fig. [31 nearly all the con- 
nections in the diagonal blocks and few in the off-diagonal 
blocks, modularity decays over time, showing the steady 
state value is below 316. The modularity will saturate 
at a value for which the effects of selection pressure and 
mutation balance each other. 

Further experimental study of the relation between 
large scale genetic exchange and the promotion of mod- 
ularity is warranted Q. Some species of yeast may un- 
dergo either sexual or asexual reproduction, and exper- 
iments suggest that yeasts undergoing sexual reproduc- 
tion arc more evolvable 



sion networks [25j , in the laboratory. At an applied level, 
we note that the process by which antibiotics resistance 
evolved makes use of the modular structure of the 
genes encoding the enzymes that degrade and the pumps 
that excrete antibiotics and the modular structure of the 
proteins to which antibiotics bind (27j . 

Why is modularity so prevalent in the natural world? 
Our hypothesis is that a changing environment selects for 
adaptable frameworks, and competition among different 
evolutionary frameworks leads to selection of structures 
with the most efficient dynamics, which are the modular 
ones. We have provided evidence validating this hypoth- 
esis. Wc suggest that the beautiful, intricate, and inter- 
related structures observed in nature may be the generic 
result of evolution in a changing environment. The exis- 
tence of such structure need not necessarily rest on intel- 
ligent design or the anthropic principle. 



24j. It would be interesting to 



construct protocols to study the relation between sexual 
recombination and modularity, possibly in gene exprcs- 
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